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(71) We, International Business 
Machines Corporation, a Corporation or- 
ganized and existing under the laws of the 
State of New York in the United States of 
America, of Armonk, New York 10504, 
United States of America, do hereby declare 
the invention, for which we pray that a 
patent may be granted to us, and the method 
by which it is to be performed, to be par- 
ticularly described in and by the following 
statement: — 

This invention relates to an interactive 
enquiry system having a distributed data 
base and which may be used, for example, 
in a seat reservation and/or ticketing sys- 
tem. 

Computerized reservation and ticketing 
systems in the past have comprised a central 
processor used to control a central data 
base. A number of remote terminals, nor- 
mally consisting of a keyboard and display 
or printer, were connected to the central 
processor. Whenever the ticketing/reserva- 
tion clerk wished to make an enquiry or con- 
duct a transaction it was necessary for a 
connection to be established with the central 
data base. 

Such an arrangement sutlers from three 
disadvantages. Firstly, the cost of establish- 
ing the connection between the remote ter- 
minal and the host computer is not incon- 
siderable and is increasing all the time. 
Secondly, the time required to establish the 
connection adds to the response time of the 
clerk to answer a query or complete a trans- 
action. Thirdly, the whole system is depen- 
dent upon the reliability of the communica- 
tion links. 

The Complete Specification of our co- 
pending Application for Letters Patent! No. 
16749/74 (Serial No. 1.437,883) describes a 
ticketing system in which a number of local 



data bases are established so that trans- 
actions can be completed without the need 
for accessing the central data base for every 
transaction. Such an approach clearly re- 
quires local storage for the local data base 
and a local processing capabilty for the con- 
trol of that! local data base and the terminals 
connected to it. Therefore when considering 
the relative costs between a centralized data 
base system and a distributed data base sys- 
tem, some balance must be struck between 
the costs of local storage and the costs of 
communication. All other things being 
equal, faster response times represent re- 
duced costs due to increased productivity 
on the part of the ticketing? reservation 
clerk. 

The present invention is concerned with a 
distributed data base system employing 
storage management arrangements which 
tend to reduce the need for excessively large 
amounts of local storage. 

According to the present invention, an 
interactive enquiry system comprises a host 
data processor, a central data store con- 
trolled by the host processor for storing a 
data base, a plurality of local sub-systems 
connectible to the host processor and each 
including a local data processor for con- 
trolling access to the host processor and a 
local data store connected to the local pro- 
cessor for storing a copy of part only of the 
data base, at least one enquiry terminal in 
each subsystem connected to the local pro- 
cessor for accessing any accessible item in 
the data base stored in the central store, 
and means in each subsystem for creating 
space in the local store by deleting from the 
local store those items in the data base least 
frequently requested by the terminal or ter- 
minals in that particular subsystem, where- 
by in operation the most popular items re- 
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quested in a particular subsystem may be 
accessed without accessing the host pro- 
cessor. 

The invention will now be particularly des- 
5 cribed, by way of example, with reference to 
the accompanying drawings, in which: — 

Figure 1 is a schematic of a ticketing sys- 
tem employing a distributed data base, 

Figure 2 illustrates the structure of the 
10 data base, 

Figure 3 illustrates the configuration of a 
local subsystem, 

Figure 4 illustrates an example of a ticket- 
ing clerk's display, 
15 Figure 5 is a schematic showing the dis- 
tribution of the data base, 

Figure 6 is a flow chart illustrating a first 
method of accessing a page in the data base, 

Figure 7 illustrates how pages can be 
20 stored in a random access memory, 

Figure 8 illustrates how data base pages 
may be stored on a magnetic disc file, 

Figure 9 is a flow chart illustrating a 
second method of accessing a page in the 
25 data base, 

Figure 10 illustrates an alternative method 
of storing pages in the random access mem- 
ory, 

Figure 11 illustrates an alternative method 
30 of storing data base pages on the magnetic 
disc file, 

Figure 12 shows the format for a direc- 
tory entry for the method shown in Figure 

35 Figure 13 shows the format for directory 
entries for the method shown in Figure 11, 
and 

Figure 14 shows an example of typical 
directory entries of the type shown in Figure 
40 13. 

Referring now to Figure 1, a central host 
processor 1, for example an IBM (Registered 
Trade Mark) system 370/model 145 com- 
puter, has a central data base 2 containing 

45 a complete ticketing data base. Data base 
2 may be stored, for example, on IBM 
(Registered Trade Mark) 3330 or 3340 disc 
files although any other bulk storage device 
could be used. 

50 Connected to the host processor 1 through 
communication links 3 are local ticketing 
subsystems 4 which are located at each ticket 
issuing station in the network. The network 
mighrbe constituted by a complete railway 

55 network. To allow tickets for other railway 
systems to be sold, host processor 1 may be 
connected through a communication link 5 
to other processors 6 which in turn contain 
the data bases associated with their own 

60 network. 

Each local subsystem comprises a con- 
troller or local processor 7, for example an 
IBM (Registered Trade Mark) 3773 Model 
2 Controller which contains an arithmetic 

65 logic unit 8, semiconductor read only stor- 



age and semiconductor random access 
memory 9, a* communications adapter 10 
for connecting the controller to the remote 
host 1, input/ output adapters 11, and a 
storage file adapter 12. Connected to the 70 
input output adapters 11 are a number of 
data entry units 13 : each data entry unit com- 
prises a display screen 14 for passing mess- 
ages to the ticketing clerk and for displaying 
details of the transaction being conducted, a 75 
number of keys or buttons 15 by which the 
clerk can enter data and/ or initiate functions, 
and a printer 16 for issuing tickets when a 
transaction is completed.' 

Attached to the file adapter 12 is a small 80 
data store 17 for storing a portion of the 
data base locally. Data store 17 may, for 
example, be a. floppy magnetic storage disc 
file such as the IBM (Registered Trade 
Mark) Diskette. The size of the store 17 85 
will determine the size of the local data base 
and this size must be weighed against its 
cost. 

Figure 2 illustrates how the data base used 
in the ticketing system shown in Figure 1 is 90 
structured. The data base is built up from 
a number of pages or program segments 
(items) in a tree structure as is described in 
detail in bur Complete Specification No. 
1,437,883. Briefly however, page or pro- 95 
gram segment 21 represents the root of a 
data base for ticketing and from which other 
pages of the data base can be accessed. In 
addition, page 21 can be used to gain ac- 
cess to other pages 22 used to display pages 100 
from an enquiry or reservation data base. 
Page 21 also contains pointers to pages 23 
which are used to display place names, pages 
27 which are used to display dates, and pages 
24 which are used during the display of 105 
city pairs. In turn, pages or program seg- 
ments 24 have pointers to other pages, for 
example, pages 26 which are used to create 
alternative routes when these is a choice of 
route between a particular city pair and 110 
pages 25 which are used to display special 
fares. 

Figure 3 shows the organisation of the 
local subsystem 4. It is envisaged that where 
the controller is an IBM (Registered Trade 115 
Mark) 3773 Model 2 Controller, up to three 
data entry units can be connected to the con- 
troller. Within the controller is a disc file 
17 which consists of a floppy magnetic re- 
cording disc 32 continuously rotating in a 120 
vertical plane as indicated by arrow 33. The 
disc 32 is sandwiched between a magnetic 
recording head carried on arm 34 and a pres- 
sure pad carried on arm 35. The arms 34 
and 35 can be moved along a radius of the 125 
disc 32 as shown by arrow 36 so as to 
access different tracks on the disc 32. When 
a particular track is accessed and informa- 
tion is stored thereon or read therefrom, the 
pressure pad on the arm 35 is moved to- 130 
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wards the disc as indicated by arrow 37 to 
bring the magnetic recording surface into 
contact with the recording head. Such a 
disc store can hold up to 256K bytes of data 
5 in some 77 tracks. 

Also located with the controller is a ran- 
dom access store 31. The store 31 is divided 
into a number of sections, that is display 
buffers 40, printer buffers 41, a page or 

10 program segment buffer 42, and a com- 
munication line buffer 43. Typical sizes for 
these buffers are 2,400 bytes for the display 
buffers 40; 600 bytes for the printer buffers 
41; 600 bytes for the communication line 

15 buffer 43; and 10,000 bytes for the page 
buffer 42. 

Movement of data within the subsystem 4 
is controlled by means of a control unit 30. 
It ! will be appreciated that the control unit 30 

20 can be a purpose-built hardware unit or it 
can consist of general purpose hardware con- 
figured and constrained to operate in a par- 
ticular way by miaoprogpramming tech- 
niques. Using an IBM (Registered Trade 

25 Mark) 3773 Model 2 controller, the control 
unit is configured using read only storage, 
random access storage and microcode in a. 
similar manner to that known in the art. 
Since such microcode etc. does not form part 

30 of the present invention and is well within 
the scope of the competent system designer, 
no details are given within this specification. 

At the heart of the control unit 30 is a 
supervisor 50 whose purpose is to control 

35 the overall operation of the different parts of 
the system. Data being read to and from 
disc file 17 on line 39 are organised and 
controlled by means of a disc manager 38 
which in turn is controlled by the supervisor 

40 50 through line 56. In a similar manner, 
signals on line 51 from the buttons or keys 
15 of the data entry unit are interpreted 
by a data entry unit manager 44 which in 
turn is supervised by the supervisor 50 

45 through line 57. The data entry unit man- 
ager 44 also has the function of assembling 
data to be displayed within the display buffer 
40 along line 52. Data within the buffer 40 
are transmitted along line 68 to the displays 

50 14. 

Similarly, data to be printed in printer 16 
are transmitted along line 69 from the print 
buffer 41 where they are assembled, via line 
53, under control of a print manager 45 

55 in turn supervised by the supervisor through 
line 58. As was indicated earlier, com- 
munication with the host processor ^ is 
through the communication buffer 43 which 
is loaded through line 55 by a communica- 

60 tion line manager 49. Under control of the 
supervisor 50 through line 63, communica- 
tion line manager 49 ensures that data to be 
transmitted to the host processor or data 
received from the host are correctly for- 

65 matted and synchronized. 



As will be described in more detail later, 
the bulk of the storage space on magnetic 
disc 32 is occupied with items or pages (pro- 
gram segments) from the data base. It also 
contains records of transactions which have 70 
been conducted on each terminal. As an 
alternative, a magnetic cassette recorder 65 
could be used to record journal entries and 
similar sorts of information in which case 
the recorder 65 could be controlled by a 75 
cassette manager coming under the overall 
supervision of the supervisor 50 through line 
67. The cassette recorder could be used in- 
stead of the disc file 17 to store pages but 
generally would be too slow for such a pur- 80 
pose. 

As was indicated earlier, pages or program 
segments can be stored within the disc drive 
17 or within the page buffer 42. Pages are 
loaded into and out of page buffer 42 along 85 
line 54 under the control of a page manager 
47 supervised along line 60 by the super- 
visor 50. Pages read from the buffer 42 are 
interpreted by a page interpreter 48 con- 
nected to the page manager 47 and the super- 90 
visor 50 by lines 61 and 62 respectively. As 
is now common with data processing equip- 
ment, the supervisor 50 can access a diagnos- 
tics unit 46 for utilizing diagnostics programs 
along line 59 when a fault develops within 95 
the subsystem: this aids a service engineer 
in determining which unit or component of 
the subsystem is faulty. 

The operation and piupose of the various 
parts shown in Figure 3 are described in the 100 
Complete Specification of our co-pending 
Application for Letters Patent No. 10814/76 
(Serial No. 1,504,113) which describes the 
use of an area gazetteer to access selected 
city-pair pages. As was explained earlier 105 
with reference to Figure 2, there are within 
the data base a very large number of pages 
which define city-pairs. Thus with 'V 
cities or stations in a transport system, there 
are n(n — 1)/2 possible city-pair combina- 110 
tions. Thus in order to issue a ticket for a 
journey between any origin and any des- 
tination a terminal must be able to locate 
any one of the n(n — 1)/2 combinations. The 
speed with which a particular ticket can be 115 
generated will depend essentially on whether 
or not and where the corresponding city-pair 
page is stored locally or whether there is a 
need to access the data base at the central 
host processor. Storing all the city-pair pages 120 
locally is impractical since to do so would 
require too much local storage. 

As was explained in the Complete Speci- 
fication of our co-pending Application for 
Letters Patent Serial No. 1,437,883, a num- 125 
ber of subsets of the data base can be stored 
locally. If these subsets contain the most 
popular city-pair combinations for that par- 
ticular station, most tickets can be issued by 
accessing the local subset. However, the 130 
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number of stations in a local subset is typic- 
ally about 50 so provision must be made for 
accessing other city-pair pages quite quickly. 
Figure 4 is a view showing the display 
5 and data entry unit. The display 14 is di- 
vided into four areas. Area 101 allows 
ticket information to be displayed so that 
the ticketing clerk can control which ticket 
will be printed when he presses ticket but- 
10 ton 100. The buttons 15 at the top of the 
screen 14 labelled LSI, LS2, LS3, LS4 and 
BS allow the clerk to display four local sub- 
sets of the data base and also allow him to 
backspace to the root of the data base. The 
15 remaining buttons or keys 15 are distributed 
around the screen 14. Their function is vari- 
able and the particular function they re- 
present is displayed on an adjacent area of 
the screen. Thus area 102 is used to label 
20 the left-hand keys 15, area 103 is used to 
label the right-hand keys 15, and area 104 is 
used to label the bottom keys 15. 

The use of the data entry unit' and display 
to interrogate the data base is described in 
25 detail in the Complete Specification of our 
aforementioned patent applications. 

As will be explained in more detail later, 
if a particular journey can be created from 
the local subset, there is no need to access 
30 the full data base at the host processor. The 
embodiment to be described is aimed at dis- 
tributing the data base so that the need for 
accessing the host is minimized. Not only 
does this reduce communication costs but 
35 it also ensures that a larger proportion of 
tickets can be issued even if the link to the 
host is defective. 

Fieure 5 is a schematic which summarizes 
howlhe data base is distributed. Displays in 
40 the data entry unit 13 are controlled from 
pages stored in the random access memory 
31. The host store 2 contains the full data 
base including all city-pair pages and a full 
gazetteer. The gazetteer serves as a direc- 
45 tory of city names to enable access to be 
made to pages representing particular city- 
pair combinations. Typically the full 
gazetteer may contain some 3,000 to 3,500 
names which require some 200 pages in the 
50 data base. With 3,000 names, there are 
4,498,500 possible journeys requiring some 
4.5 million pages. Assuming that each city- 
pair page contains 20 bytes this means that 
the data base would require about 90 million 
55 bytes just for city-pair pages. Clearly it is 
impractical to store all these in the local disc 
store 17 and so only the more popular pages 
are stored in the disc store 17. Certain tracks 
17A are retained for an area gazetteer and 
60 selected city-pair pages which constitute the 
local subsets selected by the ticketing staff. 
These tracks are protected against deletion. 
Tracks 17B on the other hand contain pages 
which are fetched from the host but which 
65 are retained in storage in accordance with 



their frequency of use. Typically, the IBM 
(Registered Trade Mark) Diskette floppy 
magnetic disc can be arranged to store data . 
in 77 tracks. Each of the 77 tracks contains 
26 blocks of data, each block being con- 70 
stituted by 128 bytes. With the local sub- 
sets and area garetteer requiring some 8 
tracks up to 47 tracks may be made avail- 
able for storing pages fetched from the host 
data base. This means that up to 1,000 75 
city-pair pages, in addition to fare calculation 
and control pages, can be fetched and stored 
within tracks 17B. 

Apart from containing pages actually be- 
ing used to create the display, the random 80 
access memory also contains the most re- 
cently or frequently used pages including 
pages in the local subset. It has been esti- 
mated that, with such an arrangement out- 
lined above, up to 90% of all ticket requests 85 
can be handled directly from the random ac- 
cess memory, and up to 99% can be handled 
without requiring access to the host. 

One data storage management technique 
will now be described with reference to 90 
Figures 6 to 8. Figure 6 is a flow chart 
illustrating the functions performed by the 
supervisor 50, Figure 3, the page manager 
47, Figure 3, and the disc manager 38, Figure 
3. When the supervisor 50 determines that 95 
a page or program segment is required it re- 
quests the page manager 47 at 70 whether or 
not that particular page is in the random 
access memory 31. If the required page is 
in RAM 31, then it is accessed as at 71 for 100 
display or calculation. 

If the required page is not in RAM 31 the 
supervisor 50 determines at 72 whether the 
page is in a look-aside table stored in RAM 
which contains the address on the magnetic 105 
disc 32 of the most recently or most fre- 
quently used pages. If the address of the 
required page is found in the look-aside 
table, the page is immediately fetched at 73 
by the disc manager 38 from the disc 32 and 110 
written into RAM 31 as at 77 : the page can 
then be accessed as at 71. 

If on the other hand it was determined at 
72 that the page address was not in the 
lookaside table, it is necessary to determine 115 
whether the page is stored on the disc or 
whether access to the host is necessary. A 
preferred technique is to perform a hashing 
operation on the number of the page re- 
quired as at 74 and from this determine in 120 
which track or group of tracks on the disc 
32 the page will be stored if it is present. 
Thus at 75 the track determined from the 
hash operation would be accessed and a de- 
termination would be made using the track 125 
directory to determine whether the page re- 
quired was actually present If the page is 
stored it can be fetched by the disc manager 
38 as at 73. In this case the lock-aside table 
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is updated, the new entry replacing that of 
the least recently fetched page. 

If it is determined that the required page is 
not stored in the disc store 17, the super- 
5 visor 50 causes the communication manager 
49 to fetch the required page from the host 
as at 76. When the page is received from 
the host, it is immediately written into RAM 
31 as at 77 for subsequent use. Simul- 

10 taneously, the supervisor 50 causes the disc 
manager 38 to determine whether there is 
storage space in the appropriate track (if 
hashing is used) on the disc 32 as at 78. If 
there is sufficient space on the disc 32, the 

15 fetched page is written into the disc store 17 
as at 79 by the disc manager 38. If there is 
insufficient space in the disc store 17 to 
store the fetched page, then space is created 
by deletion as at 80, of the least frequently 

20 used pages provided these pages are not pro- 
tected. 

Thus when a page is required, the super- 
visor first causes the page manager 47 to 
determine whether the page is in RAM 31 

25 then if necessary causes the disc manager 38 
to determine whether the page is stored in 
the disc store 17, and then if necessary 
causes the communications manager 49 to 
fetch the page from the host. In a modifica- 

30 tion if it is determined that the required 
page is not in RAM 31, access can be initi- 
ated to the host as indicated by 81 whilst it 
is determined whether the page is in the disc 
store 17. The supervisor 50 in this case 

35 would cancel the request to the host if the 
page is first fetched from the disc store 17 or 
would cancel the request to the disc store 17 
if the page is first fetched from the host This 
would prevent the accumulation of delays 

40 which might occur should the host not be 
accessed until after it had been determined 
that the required page was not stored locally. 

Figure 7 schematically illustrates how 
pages are brought into and deleted from the 

45 random access memory. When the page 
manager 47, Figure 3, fetches a page for 
storage in the RAM, it first determines its 
position in the data base tree-structure. The 
position of the page above or below a thres- 

50 hold level determines whether the page is 
added to a least-recently-used (LRU) chain 
or to a quite-recently-used (QRU) chain. 
Pages at a level below the threshold are 
added to the LRU chain and pages above 

55 the threshold are added to the QRU chain, A 
minimum number of pages can be specified 
for the QRU chain. Each chain operates as 
a push down stack with the most recently 
used page at the top of the stack. 

60 When the page buffer is full, it is neces- 
sary for space to be created by deletion of 
least-frequently-used pages and the algorithm 
illustrated in Figure 71 can be used for this 
purpose. 

65 Firstly, the page manager 47, Figure 3, 



determines at 90 whether there is sufficient 
space in RAM for the new page. If there is, 
the page manager writes the new page into 
RAM as at 91 chaining it to the appropriate 
stack, that is QRU or LRU. If however the 70 
determination at 90 is that the RAM is full 
and deletion is required to create space, the 
page manager determines at 92 whether the 
QRU chain contains the minimum number 
of pages. If it does not then the least re- 75 
cently used page in the QRU chain is de- 
leted as at 93. When sufficient space has 
been created in this way, the new page can 
be stored in RAM at the top of the appro- 
priate stack, that is QRU or LRU. If, how- 80 
ever, it was determined at 92 that the QRU 
chain contained the minimum number of 
pages, the page manager deletes the least 
recently used page from the LRU chain as 
at 94. 85 

The use of this algorithm and the fact 
that the most frequently used pages statis- 
tically will be at or near the root of the tree 
or sub-trees, means that in operation the 
most frequently used pages will tend to be 90 
concentrated towards one end of the page 
buffer. 

Figure 7H is a schematic of the random 
access memory 31 illustrating how various 
pages 95 will be randomly distributed 95 
through the RAM. The arrows in Figure 
7H represent pointers from one page in the 
chain to the next page in the chain. 

Figures 7A to 7G serve to illustrate how 
pages are brought into and out of the QRU 100 
and LRU stacks. It should be noted that 
Figures 7A to 7G do not represent physical 
positions with the page buffer but represent 
the logical positions within the QRU and 
LRU stacks. 105 

Figures 7A to 7F represent the two chains 
during 7 transactions. As represented by 
Figure 7A, some five pages have been in- 
serted into the stack. The first two pages 
fetched, Pill and P222 each have a position 110 
in the tree structure below the threshold level 
for this particular transaction and accord- 
ingly have been entered in the LRU chain. 
The third, fourth and fifth pages however are 
at a level in the data base tree structure for 115 
this particular transaction above the thresh- 
old level and have been insered into the 
QRU chain. 

For the next transaction, represented by 
Figure 7B, a further five pages P100, P200, 120 
P300, P400 and P500 have been fetched in 
that order. Pages P100 and P200 are below 
the threshold level and accordingly have been 
added to the LRU chain. Pages P300, P400 
and P500 are above the threshold and belong 125 
to the QRU chain. 

For the third transaction, represented by 
Figure 7C, assume that only two pages need 
to be fetched, namely PI 10 and P210, the 
first below and the second above the thresh- 130 
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old level. This fills the page buffer and to 
add any new pages will require deletion of 
old pages, normally from the QRU stack to 
create sufficient space. * 

5 In the next transaction, Figure 7D, assume 
that four pages are required, namely P101 
and P202 determined to be below the 
threshold level and P303 and P404 deter- 
mined to be above the threshold level. In 

10 this case pages P300, P555, P444 and P333 
will be deleted from the QRU chain by the 
page manager. 

As was indicated above, a minimum num- 
ber of pages can be specified for the QRU 

15 chain and assume that this minimum number 
is 5. As shown in Figure 7D the QRU chain 
now contains this minimum number. If the 
next transaction were to fetch three pages of 
such a size that pages P210, P500 and P400 

20 can be deleted from the QRU chain to create 
sufficient space whilst still meeting the mini- 
mum requirement then these pages will be 
deleted. If on the other hand the page 
manager determines that the new pages re- 

25 quire so much space in the QRU chain that 
the minimum number condition can no 
longer be fulfilled if pages are deleted from 
the QRU chain, the least recently used page 
or pages in the LRU chain is or are deleted. 

30 This is illustrated in Figure 7E where two 
new pages P330 and P440 have been fetched 
and inserted into the QRU chain and pages 
P220 and P140 have been added to the LRU 
chain. To meet the requirement for a mini- 

35 mum number of pages in the QRU chain 
only pages P500 and P400 have been deleted 
from the QRU chain : pages at the bottom 
that is the least recently used pages of the 
LRU chain also having been deleted to 

40 create space. Pages can vary in the amount 
of data they contain and this is represented 
schematically by showing certain pages as 
occupying more space than others, for ex- 
ample pages P330, 440 and 220. 

45 Figure 7F illustrates the situation where 
the transaction shown in Figure 7A is re- 
peated. To maintain the minimum number 
of pages in the QRU chain, pages are deleted 
from the LRU chain. Thus pages PI 1 1, P222, 

50 P333, P444 and P555 are fetched and inserted 
into the QRU and LRU chains whilst pages 
P404, P303, P210, P202 and P101 have been 
deleted. 

Figure 7G represents the schematic where 
55 the transaction of Figure 7B has been re- 
peated. Thus pages P500, P400 and P300 
have been fetched and inserted in the QRU 
chain with pages P200 and P100 inserted 
into the LRU chain. Sufficient space has been 
60 created by deleting pages from the QRU 
chain only. It is emphasized that Figure 7 
shows the logical positions in the chains as 
controlled bv the page manager 47 and does 
not represent the actual physical position of 
65 the pages within the page buffer 42. As pages 



are deleted from the buffer 42 to create space 
for new pages, the page manager 47 con- 
solidates the space in the buffer so that the 
available space is not physically fragmented 
through the buffer. The combined effect of 70 
this consolidation technique and the QRU 
and LRU chain technique is to pack the 
frequently used pages at one end of the page 
buffer: the lower level pages are used more 
frequently than the higher level pages. 75 

Figure 8 illustrates one technique for man- 
aging the storage on the disc file 17. The 
IBM (Registered Trade Mark) diskette con- 
tains up to 77 tracks. Figure 8 A illustrates 
how the various tracks are allotted to store 8Q 
different sorts of data. Thus tracks 1 to 4 are 
used to provide a copy of certain data in the 
random access memory RAM. Such data 
include a table containing the addresses on 
the disc of the 32 most frequently fetched 85 
pages. The table size depends on the access 
pattern of the systems but typically may con- 
tain 32 entries. This most frequently fetched 
table serves as a look-aside table as was 
explained with reference to Figure 6. Also 90 
stored in RAM and the back up tracks is a 
track-use table whose purpose is to keep a 
record of protected tracks, tracks in need of 
compression and also a record of the fre- 
quency of access or use of the different 95 
tracks. Thus when space needs to be created 
within the disc file, the disc manager can 
access the track-use table to determine which 
unprotected track is least frequently used: 
space can then be created on this track. 100 

Tracks 5 to 13 are reserved for pages con- 
stituting the local subsets and area gazetteer. 
As such they are protected from deletion 
during the normal space creating process 
although they can be replaced if a new local 105 
gazetteer or new local subsets are created. 
The full gazetteer and city-pair pages are 
divided into four groups A, B, C and D. Any 
pages from group A are stored in tracks 14 
to 20, any from group B in tracks 24 to 29, 110 
any from group C in tracks 47 to 54, and 
any from group D in tracks 59 to 67. Since 
the pages are of variable size, it is convenient 
to have overflow tracks 21 to 23 for groups 
A and B and overflow tracks 55 to 58 for 115 
groups C and D. 

Tracks 30 and 39 and 41 to 46 are reserved 
for journal tracks : these contain entries rer 
cording which transactions have been com- 
pleted at the different ticketing terminals con- 120 
nected to the local processor. Track 40 
contains a record of the current cash balance 
for each terminal together with the serial 
number of the current transaction. Tracks 
68 to 77 are reserved tracks. 125 

It should be noted that the IBM (Regis- 
tered Trade Mark) diskette is in contact with 
the recording/playback head when it is 
being accessed. Therefore frequently accessed 
tracks can cause undue wear of the diskette. 130 



7 



1,504,112 



7 



Such wear can be equalized by periodically 
re-distributing the tracks over the disc sur- 
face. Thus the track numbers in Figure 8A 
do not necessarily reflect the physical posi- 
5 tion of the tracks on the disc. 

Each of the 77 tracks contains 26 blocks 
of data, each block being constituted by 128 
bytes. The beginning of each track used for 
page storage contains a three block directory 

10 which indicates the position within the tracks 
of the pages stored therein, Figure 8B. The 
exact format of the directory can be of any 
suitable form but preferably the first eight 
bytes of the 3-block directory constitute a 

15 directory heading containing, for example, 
the number of directory entries in use, the 
next block available for storage and the 
starting point within that block, an indica- 
tion of whether an overflow area is in use 

20 and whether the track is a protected track. 
The directory entry for a page would norm- 
ally contain, for example, the page number 
(for example 32 bits), the address of the 
page, whether it is protected, and a count of 

25 its frequency of use. 

When a page is written onto the disc, the 
page number is hashed to determine in which 
group of tracks (or alternatively which track) 
it should be stored. Thereafter when the disc 

30 manager determines from the track-use table 
which track is least frequently used, it can 
determine from the track directory which 
blocks are least frequently used within that 
track. 

35 The use counts in the track use table 
and the track directory are given a high 
value when a page is written into the disc 
file. Periodically the count is decremented, 
for example, once daily or at the beginning 

40 of each shift, and is restored to the high 
count whenever the page is used. 

Periodically, pages stored on the disc file 
can be rewritten so as to consolidate the free 
space and prevent undue fragmentation of 

45 space. 

The storage management techniques des- 
cribed with reference to Figures 6 to 8 are 
particularly useful where the frequency of 
access to the pages or program segments is 

50 biased such that some pages are more fre- 
quently accessed than others. The alternative 
techniques to be described with reference to 
Figures 9 to 14 are useful where the access 
pattern of the pages is more random. 

55 Figure 9 illustrates an alternative technique 
to that shown in Figure 6 for accessing a 
page of the data base. If the page manager 
47, Figure 3, indicates at 70 to the superviser 
50, Figure 3, that the required page is not 

60 in the random access memory, a determina- 
tion is made at 72 1 to determine whether the 
required page is stored in the disc directory. 
As will be explained in more detail with 
reference to Figures 11 and 13, the disc 

65 directory is stored in two tracks on the disc 



file. If the disc manager 38, Figure 3, deter- 
mines that the required page is stored in the 
disc file, it is fetched and written into RAM 
(steps 73 and 77) in a similar manner to 
Figure 6. If the page is not in the disc direc- 70 
tory, it is fetched from the host as at 76 
and subsequently written into RAM and on 
to the disc (steps 77 to 79) in a similar 
manner to Figure 6. 

To prevent accumulative delays the super- 75 
visor 50 may control the communication 
manager 49 to fetch a page as indicated by 
81 1 as soon as it has been determined that 
the desired page is not in RAM. The super- 
visor 50 would then monitor which of the 80 
disc manager 38 or the communication man- 
ager 49 fetched the required page first and 
then cancel the request to the other. 

Figure 10 schematically illustrates how the 
page buffer 42 may be organized. The buffer 85 
42 is divided into two sections, one contain- 
ing a directory and one containing pages and 
free space for pages. The boundary between 
the two buffer sections is floating since to 
have a fixed boundary would waste space. 90 
Figure 12 is an example of a possible direc- 
tory entry format consisting of 10 bytes. 
The first four bytes represent the page num- 
ber, the fifth byte represents the extent or size 
of the page, and the sixth byte indicates the 95 
type of page, for example whether the page 
is used to create a display or whether it is 
used for calculation purposes only. 

Bytes 7 and 8 contain the address in RAM 
of the page, byte 9 contains a count repre- 100 
senting the amount of use of the page and the 
tenth byte represents the status. There are 
two ways in which the count can be used to 
indicate the use of the page. In a first 
method, the count is given a high value when 105 
the page is first written into RAM and is then 
periodically decremented, the high count 
being restored whenever the page is used. 
In a second method, the count is incremented 
each time the page is use: when the count 110 
field (byte) is full the page manager auto- 
matically divides all count fields in the direc- 
tory by 2 (that is shifts the bits within the 
byte one position to the left dropping the 
least significant bit). 115 

Directory entries are entered into the 
buffer 42 from the top represented by arrow 
DE in Figure 10) and pages are entered into 
buffer 42 from the bottom (represented by 
arrow PE in Figure 10). Figure 10A repre- 120 
sents the situation at initialization when the 
page buffer is completely empty, the space 
being headed by a space pointer SP. The 
space pointer contains an indication of the 
extent of the space it heads and also the 125 
address in the buffer of the next space, if any. 

Figure 10B represents the situation where 
pages and their corresponding directories 
have been inserted in the buffer 42 as they 
are fetched from the host or the disc file. 130 



8 



1,504,112 



8 



Eventually, as shown in Figure IOC, the 
buffer will be completely filled with pages 
and their directory entries. When a new 
page has to be added to the buffer, space 

5 must be created by deleting infrequently 
used pages. There are two possibilities. Either 
the page manager can determine which page 
has the lowest frequency of use count and 
cause it to be deleted or the page manager 

10 can delete all pages having frequency of use 
counts below a threshold value. Preferably 
the page manager scans the directory exam- 
ining the frequency of use byte. If its value 
is below the threshold value, its status byte 

15 is examined to determine whether it can be 
dropped. If the extent is equal to or greater 
than the required space, this page becomes 
the most eligible for deletion and is dropped. 
If its extent is not sufficient, scanning of the 

20 whole directory proceeds until either a suffi- 
ciently large page suitable for deletion is 
obtained or the whole directory has been 
scanned. 

If there is insufficient space created by 
25 dropping one page, the page manager will 
delete more than one. As soon as a page 
(and its corresponding directory entry) is 
deleted, a space pointer is inserted into the 
space thereby created. If the space is only 
'30 partially filled with the new page, a new 
space pointer is inserted at the head of the 
remaining space. This is illustrated in Figure 
10D where pages have been inserted into 
two spaces. 

35 Periodically, to prevent excessive fragmen- 
tation of the space through the buffer, the 
space can be consolidated between the direc- 
tory area and the pages. Consolidation of 
space is easier if undertaken from the direc- 

40 tory end of the page buffer. Accordingly each 
space pointer preferably has within it the 
address of the space immediately preceding 
it as well as that of the space immediately 
following it. Figure 10E represents the situa- 

45 tion where the space has been consolidated. 
Although not essential, at this time the 
page manager, can also rearrange the pages 
into frequency-of-use order. 
Figure 11 is similar to Figure 8A but 

50 illustrates an alternative technique for allo- 
cating pages to the different track of the 
magnetic disc file. Tracks 1 to 4 are allocated 
to store the random access memory backup 
(with or without a track use table or a table 

55 of the most recently used pages). Tracks 5 to 

13 are allocated to store the local subsets 
and the area or local gazetteer whilst tracks 

14 to 30 are designated for storing the jour- 
nal and balance. Two tracks, tracks 31 and 

60 32, are assigned to contain the disc directory 
with the tracks 38 to 77 being used to store 
pages. Tracks 33 to 37 are reserved for 
other purposes. As in the first embodiment 
the most frequently used pages are retained 

65 in storage in disc file 17. It should also be 



noted that in order to distribute wear evenly 
over the disc in the case of in contact re- 
cording and playback, the various tracks 
and especially the journal, balance and direc- 
tory tracks can periodically be rewritten in 70 
different physical positions on the disc. The 
track numbers in Figure 11 do not therefore 
represent actual track positions. 

As in the earlier embodiment each track 
contains 26 blocks of 128 bytes each. How- 75 
ever in contrast to the earlier embodiment, 
the directory entries are not inserted at the 
begining of each track but are located in two 
dedicated directory tracks which will be des- 
cribed with reference to Figures 13 and 14. 80 
Figure 13 shows the format of the directory 
entry whilst Figure 14 shows an example of 
how entries are^made in the directory tracks. 

Referring now to Figure 13, there are two 
types of 8 byte directory entries. The first 85 
type, called a primary type is shown in 
Figure 13A with the second type, called a 
secondary type, being shown in Figure 13B. 
As will be seen from Figure 13A, the first 
four bytes of a primary entry are used for 90 
the page number with the next four bytes 
being used to indicate the size of the page, 
the type of directory entry, a count indicat- 
ing the frequency of use of the page and an 
overflow pointer pointing to the continua- 95 
tion (if any) of the page. The purpose of 
these bytes will become clearer later. As will 
be seen from Figure 13B, a secondary entry 
also comprises 8 bytes but the first four, the 
fifth and the seventh bytes are not used. The 100 
sixth byte is used to indicate that the entry is 
a secondary type while the eighth byte is 
used as a pointer to any continuation of the 
field. 

In this embodiment, the 40 tracks used for 105 
city-pairs and other pages from the host are 
each divided into 26 blocks. Thus there are 
1,040 blocks available for storage of pages. 
One paee can occupy from 1 to 8 blocks, 
that is 128 to 1,024 bytes. Each of the direc- 110 
tory tracks is divided into 1,040 sectors 
(40 tracks X 26 blocks per track). Thus the 
directory contains as many entries as there 
are blocks for storing pages. By partitioning 
the page storage area into fixed length 115 
blocks, fragmentation of space is avoided and 
time consuming space consolidation becomes 
unnecessary. Each directory entry, whether 
it is primary or secondary, contains a for- 
ward chaining pointer (OFLO) which links 120 
together all the blocks which comprise a 
particular page. Hie OFLO pointer is also 
used to manage unused space. 

The directory entries are written on two 
tracks A and B. Track A contains the search 125 
fields (that is page number) while Track B 
contains the associated argument fields con- 
taining extent, type, frequency of use counts, 
and OFLO pointer. Each directory track will 
therefore contain 26 blocks of 40 four-byte 130 
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directory entry data. The directory order is 
maintained on both tracks. Thus the nth 
search field (containing tie name of the page) 
in track A and the nth argument field in 
5 track B constitutes a complete directory entry 
for the nth block in the page storage tracks. 
The address of any page can be calculated 
from the position of its directory entry. For 
example, if the required page number if 

10 found in the tenth directory entry, then the 
address of the page is found by adding 10 
to the track/block address of the start of the 
page storage blocks. 
The OFLO pointer allows segmenting of 

15 pages which are larger than 128 bytes. It can 
contain either the relative or die absolute 
address of the directory entry and block for 
the next segment of the page. The secondary 
type of directory entry is used for overflow 

20 entries and allows the disc manager to dis- 
tinguish between pages which are more than 
one block (128 bytes) long. Furthermore it is 
useful during deletion to ensure that only 
primary entries or primary entries and their 

25 associated secondary entries are deleted. 
Since the search field of a secondary entry 
contains a null key, it can be ignored during 
a search for a particular page number. 
The use field in the directory entry is used 

30 in a similar manner to that described pre- 
viously. When space is required, the disc 
manager searches the directory to determine 
which primary entry has a low use count. 
The associated page can then be deleted, any 

35 associated secondary entries being chained 
through their OFLO pointers to a free space 
pointer contained, for example, in the first 
available position in the directory. 
Because the floppy diskette employs 

40 recording/playback head contact during 
accessing of a track, iv can be expected that 
error rates in the directory tracks will in- 
crease due to their heavy use. This problem 
can be mitigated by periodically, for ex- 

45 ample, once daily, rewriting the paired direc- 
tory tracks on to another area of the disc, 
for example, in tracks 33 to 37. 

Figure 14 shows an example of directory 
entries where the first entry is a free space. 

50 The disc manager contains a free space 
pointer which points to the first available 
free space on the disc. It does not necessarily 
have to be located in the first block. 
In Figure 14, the OFLO field contains the 

55 relative" address of the overflow directory 
entry although as an alternative the absolute 
address could be used. Thus the first entry 
contains an OFLO pointer indicating that 
further free space will be located at directory 

60 entries 9(1+8) and thence at entries 11, 12, 
etc Entry 2 is a primary directory entry for 
page number AAAA. This page occupies 
two to three blocks (between 256 and 384 
bytes) and as shown the OFLO pointer 

65 points to entry 3 (2+1): this secondary direc- 



tory entry in turn has a pointer to entry 
4 (3 + 1). Thus the three blocks of page stor- 
age corresponding to data entries 2, 3 and 4 
contain page AAAA. It will be seen that the 
OFLO pointer in entry 4 contains a zero 70 
indicating that there is no further overflow. 

Entry No 5 is a primary entry relating to 
page No BBBB which is less than 128 bytes 
in size. Entry No 6 is a primary entry relat- 
ing to page No CCCC which is stored in the 75 
6th and 8th blocks of storage. Pages Nos 
DDDD and EEEE on the other hand are 
each equal or less than 128 bytes in length 
and are stored in the seventh and tenth 
blocks of storage. To summarize, each page 80 
requires 1 primary directory entry and from 
0 to 7 secondary directory entries depending 
on its eize. Each directory entry, primary or 
secondary, corresponds to a storage block 
on the disc file. 85 

What has been described in a ticketing 
system employing various storage manage- 
ment techniques to ensure that die frequently 
used pages are retained in the local store. 
It will be appreciated that the invention is 90 
applicable to any distributed data base sys- 
tem in which a frequency-of-use technique is 
used to determine which pages of the whole 
data base are stored locally. There is, in 
such a system, a considerable difference be- 95 
tween using a frequency of use technique 
rather than a recency of* use technique. The 
latter is used, for example, in programming 
techniques such as virtual storage where the 
most recently fetched pages are held for 100 
some time after use; no attempt is made to 
retain pages in accordance with their fre- 
quency of use. With a distributed data base 
system, it is clearly an advantage to retain 
the most popular pages within the local data 105 
base since these will not correspond neces- 
sarily to the most recently used pages. A 
frequency-of-use technique tends to give a 
more adaptive system (that is more respon- 
sive to changing conditions) than does a 110 
recency of use technique. 

WHAT WE CLAIM IS:— 

1. An interactive enquirey system com- 
prising a host data processor, a central data 115 
store controlled by the host processor for 
storing a data base, a plurality of local sub- 
systems connect ible to the host processor 
and each including a local data processor for 
controlling access to the host processor and 120 
a local data store connected to the local pro- 
cessor for storing a copy of part only of the 
data base, at least one enquiry terminal in 
each subsystem connected to die local pro- 
cessor for accessing any accessible item in 125 
the data base stored in the central store, and 
means in each subsystem for creating space 
in the local store by deleting from the local 
store those items in the data base least fre- 
quendy requested by the terminal or ter- 130 
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minals in that particular subsystem, whereby 
in operation the most popular items re- 
quested in a particular subsystem may be 
accessed without accessing the host processor. 
5 2. An interactive enquiry system as 
claimed in claim 1, wherein each local store 
contains a first group of items anticipated to 
be the most popular or essential items for 
that local subsystem and a second group of 

10 items which in practice are the most popular 
items for that local subsystem, each local 
subsystem including means adapted to pro- 
tect said first group of items from deletion 
from said local store. 

15 3. An interactive enquiry system as 
claimed in either preceding claim, wherein 
each local store comprises a random access 
memory and a magnetic disc file, and where- 
in the most popular items are stored in the 

20 random access memory and the next most 
popular items are stored in die magnetic disc 
file. 

4. An interactive enquiry system as 
claimed in claim 3, in which each of said 

25 random access memories is notionally divided 
into two push-down stacks, comprising means 
in each subsystem for adding an item from 
the data base to one of the stacks in accord- 
ance with its level in the data base structure 

30 as it is accessed. 

5. An interactive enquiry system as 
claimed in claim 3, in which each random 
ccess memory is divided ino a first part for 
storing items accessed from the data base 

35 and a second part for storing a directory 
entry for each item stored within the first 
part, each directory entry containing an 
indication of the frequency of use of its 
associated item, comprising means in each 

40 subsystem operable when the first part has 
no space for a newly accessed item to scan 
the directory entries fo determine the least 
frequently used item or items and adapted 
to delete the least frequently used item or 

45 items to create sufficient space for the newly 
accessed item. 

6. An interactive enquiry system as 
claimed in claim 5, in which the random 
access memory is so arranged that the 

50 boundary between the first part and the 
second part is floating. 

7. An interactive enquiry system as 
claimed in claim 5 or claim 6, comprising 
means for periodically consolidating free 

55 space within the first 1 part. 

8. An interactive enquiry system as 
claimed in any one of claims 3 to 7, wherein 
each subsystem comprises means adapted to 
store items in selected tracks on said mag- 

60 netic disc file and to store on said disc file 
associated directory entries which include an 
indication of the frequency of use of their 
associated items, means for searching said 
disc file directory entries whenever a re- 

65 quested item is not found in said random 



access memory to determine whether the 
requester item is stored on said magnetic 
disc, and means operable when the disc file 
has no space for a newly accessed item to 
scan said disc file directory to locate a least 70 
frequently used item or items and to delete 
the least frequently used item or items to 
create sufficient space for the newly accessed 
item. 

9. An interactive enquiry system as 75 
claimed in claim 8, wherein each subsystem 
comprises means for storing a directory entry 
associated with a particular item on the 
same track as that particular item. 

10. An interactive enquiry system as 80 
claimed in claim 9, wherein each subsystem 
comprises means for performing a hashing 
operation on a number identifying an item 

to determine in which track or group of 
tracks that item should be stored. 85 

11. An interactive enquiry system as 
claimed in claim 8, wherein each magnetic 
disc file includes a plurality of dedicated 
tracks for storing items and divided into a 
plurality of storage blocks and one or more 90 
dedicated directory tracks containing a plur- 
ality of sectors corresponding in number to 

the number of storage blocks, each directory 
sector containing an entry identifying the 
item stored in the corresponding storage 95 
block. 

12. An interactive enquiry system as 
claimed in claim 8, wherein each disc file 
includes a pair of directory tracks each 
divided into a plurality of sectors equal in 100 
number to the number of storage blocks, a 
directory entry for each block being con- 
tained partly in one of the pair and partly 

in the other of the pair. 

13. An interactive enquiry system as 105 
claimed in any one of claims 3 to 12, wherein 
each subsystem comprises a list of the most 
recently used items together with their ad- 
dress within the disc file, and means oper- 
able when an item is not found in the ran- 110 
dom access memory to scan the list to deter- 
mine whether the required item is stored in 

the disc file. 

14. An interactive enquiry system as 
claimed in any one of claims 3 to 13, wherein 115 
each subsystem comprises means operable 
when a required item is not within the 
random access memory for simultaneously 
accessing the disc file and the central store 
and adapted to cancel the request to one 120 
store when the item is received from the 
other store. 

15. An interactive enquiry system as 
claimed in any preceding claim, wherein 
each subsystem comprises means for associ- 125 
ating a frequency of use count with each 
item stored in the local store, and means for 
increasing the count associated with a par- 
ticular item whenever that item is accessed. 

16. An interactive enquiry system as 130 
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claimed in any one of claims 1 to 14, where- 
in each subsystem comprises means for as- 
sociating a frequency of use count with each 
item stored in the local store, means for 
assigning a high count whenever the par- 
ticular item is accessed, and means for 
periodically decreasing the count for all items 
stored in the local store. 

17. An interactive enquiry system as 
claimed in any preceding claim in the form 



of a transport or seat ticketing and/or reser- 
vation system. 

10. An interactive enquiry system, sub- 
stantially as herein described with reference 
to the accompanying drawings. 15 

JOHN BLAKE, 
Chartered Patent Agent, 
Agent for the Applicants. 
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